Query Reranking As A Service
نویسندگان
چکیده
Many web databases are “hidden” behind proprietary search interfaces that enforce the top-k output constraint, i.e., each query returns at most k of all matching tuples, preferentially selected and returned according to a proprietary ranking function. In this paper, we initiate research into the novel problem of skyline discovery over top-k hidden web databases. Since skyline tuples provide critical insights into the database and include the top-ranked tuple for every possible ranking function following the monotonic order of attribute values, skyline discovery from a hidden web database can enable a wide variety of innovative third-party applications over one or multiple web databases. Our research in the paper shows that the critical factor affecting the cost of skyline discovery is the type of search interface controls provided by the website. As such, we develop efficient algorithms for three most popular types, i.e., one-ended range, free range and point predicates, and then combine them to support web databases that feature a mixture of these types. Rigorous theoretical analysis and extensive real-world online and offline experiments demonstrate the effectiveness of our proposed techniques and their superiority over baseline solutions.
منابع مشابه
Submodular Reranking with Multiple Feature Modalities for Image Retrieval
We propose a submodular reranking algorithm to boost image retrieval performance based on multiple ranked lists obtained from multiple modalities in an unsupervised manner. We formulate the reranking problem as maximizing a submodular and non-decreasing objective function that consists of an information gain term and a relative ranking consistency term. The information gain term exploits relati...
متن کاملStudy Of Multidomain Query Optimization And Answering
In queries having multiple domains it is seen that general purpose search engines are not able to answer multidomain queries and one of the domain is considered by specific search services but no integrated framework is obtainable. Queries which are answerable by combining knowledge from two or more domains are multidomain queries. This paper presents an overall view for multidomain queries on ...
متن کاملDEU at ImageCLEF 2009 WikipediaMM Task: Experiments with Expansion and Reranking Approaches
This paper describes participation of Dokuz Eylül University to WikipediaMM task at ImageCLEF2009. This year we concentrated on two main topics: First is about expansion of native document, term phrase selection and query expansion processes which is based on WordNet, WSD and WordNet similarity functions. The second is a new reranking approach with Boolean retrieval and CM based clustering. Exp...
متن کاملReranking Medline Citations by Relevance to a Difficult Biological Query
We have initialized research aimed at automatically extracting Medline citations of biomedical articles and reranking them according to their relevance to a certain biomedical property difficult to express as PubMed query. Our proposed approach to this problem is to train support vector machines as classifiers able to distinguish relevant citations from the rest of retrieved citations. We used ...
متن کاملMultimodal Image Retrieval over a Large Database
We introduce a new multimodal retrieval technique which combines query reformulation and visual image reranking in order to deal with results sparsity and imprecision, respectively. Textual queries are reformulated using Wikipedia knowledge and results are then reordered using a k-NN based reranking method. We compare textual and multimodal retrieval and show that introducing visual reranking r...
متن کاملUniversity of Hagen at CLEF2006: Reranking Documents for the Domain-specific Task
This paper describes the participation of the IICS group at the domain-specific task (GIRT) of the CLEF campaign 2006. The focus of our retrieval experiments is on trying to increase precision by reranking documents in an initial result set. The reranking method is based on antagonistic terms, i.e. terms with a semantics different from the terms in a query, for example antonyms or cohyponyms of...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- PVLDB
دوره 9 شماره
صفحات -
تاریخ انتشار 2016